Stylistic Experiments in Informationretrieval
نویسنده
چکیده
A discussion on various experiments to utilize stylistic variation among texts for information retrieval purposes. 1. Stylistics Texts vary in many ways. Authors make choices when they write a text: they decide how to organize the material they have planned to introduce; they make choices between synonyms and syntactic constructions; they choose an intended audience for the text. Authors will make these choices in various ways and for various reasons: based on personal preferences, on their view of the reader, and on what they know and like about other similar texts. A style is a consistent and distinguishable tendency to make some of these linguistic choices. Style is, on a surface level, very obviously detectable as the choice between items in a vocabulary, between types of syntactical constructions, between the various ways a text can be woven from the material it is made of. It is the information carried in a text when compared to other texts, or in a sense compared to language as a whole. This information { if seen or detected by the reader { will impart to the reader a predisposition to understand the meaning of text in certain ways. Or, more roughly put, style is the diierence between two ways of saying the same thing. So, the variation in a text or diierences between texts that is not primarily topical, that has not to do with meaning, is stylistic. Naturally, demarcation of stylistic variation to topical variation is impossible. Certain meanings must or tend always to be expressed in certain styles: legal matters tend to be written in legal jargon rather than hexameter; car owner
منابع مشابه
Cognitive Characterization of Geographic Objects Based on Spatial Descriptions in Web Resources
This paper examines the effectiveness of geographic informationretrieval using partial natural language analysis. In informationretrieval, two most popular methods are term frequencies and cooccurences. However, these methods do not take account of grammatical and semantical structure, therefore information that can be extracted are limited. We propose a method for geographic information retrie...
متن کاملËû×× Áò×øøøùøø Óó Óñôùøøö Ëëëëòòò Ëøýðð×øøø Üôööññòø× Óö Áòòóöññøøóò Êêøöööúð Âù××× Ããöððööò
Information retrieval systems are built to handle texts as topical items: texts are tabulated by occurrence frequencies of content words in them, under the assumption that text topic is reasonably well modeled by content word occurrence. But texts have several interesting characteristics beyond topic. The experiments described in this text investigate stylistic variation. Roughly put, style is ...
متن کاملRuThes Linguistic Ontology vs. Russian Wordnets
The paper describes the structure and current state of RuThes – thesaurus of Russian language, constructed as a linguistic ontology. We compare RuThes structure with the WordNet structure, describe principles for inclusion of multiword expressions, types of relations, experiments and applications based on RuThes. For a long time RuThes has been developed within various NLP and informationretrie...
متن کاملEnsuring Stylistic Congruity in Collaboratively Written Text : Requirements Analysis and Design IssuesMelanie
ness, concreteness, staticness, or dynamism. The other grammar-based approach to assessing stylistic goals was implemented by Ryan et al. (1992). The stylistic goals of this grammar (with settings in brackets) are as follows: emphasis (emphatic, neutral, at); clarity (clear, neutral, obscure); and dynamism (dynamic, neutral, static). In this grammar, the basis for evaluating these goals was sem...
متن کاملLearning Age and Gender of Blogger from Stylistic Variation
We report results of stylistic differences in blogging for gender andagegroupvariation.Theresultsarebasedontwomutually independent features. The first feature is the use of slang words which is a new concept proposed by us for Stylistic study of bloggers. For the second feature, we have analyzed the variation in average length of sentences across various age groups and gender. These features ar...
متن کامل